Search CORE

4 research outputs found

Breast cancer prognosis by combinatorial analysis of gene expression data

Author: David E Axelrod
Gabriela Alexe
Irina I Lozina
Michael Reiss
Peter L Hammer
Sorin Alexe
Tibérius O Bonates
Publication venue: Springer Nature
Publication date: 01/01/2006
Field of study

INTRODUCTION: The potential of applying data analysis tools to microarray data for diagnosis and prognosis is illustrated on the recent breast cancer dataset of van 't Veer and coworkers. We re-examine that dataset using the novel technique of logical analysis of data (LAD), with the double objective of discovering patterns characteristic for cases with good or poor outcome, using them for accurate and justifiable predictions; and deriving novel information about the role of genes, the existence of special classes of cases, and other factors. METHOD: Data were analyzed using the combinatorics and optimization-based method of LAD, recently shown to provide highly accurate diagnostic and prognostic systems in cardiology, cancer proteomics, hematology, pulmonology, and other disciplines. RESULTS: LAD identified a subset of 17 of the 25,000 genes, capable of fully distinguishing between patients with poor, respectively good prognoses. An extensive list of 'patterns' or 'combinatorial biomarkers' (that is, combinations of genes and limitations on their expression levels) was generated, and 40 patterns were used to create a prognostic system, shown to have 100% and 92.9% weighted accuracy on the training and test sets, respectively. The prognostic system uses fewer genes than other methods, and has similar or better accuracy than those reported in other studies. Out of the 17 genes identified by LAD, three (respectively, five) were shown to play a significant role in determining poor (respectively, good) prognosis. Two new classes of patients (described by similar sets of covering patterns, gene expression ranges, and clinical features) were discovered. As a by-product of the study, it is shown that the training and the test sets of van 't Veer have differing characteristics. CONCLUSION: The study shows that LAD provides an accurate and fully explanatory prognostic system for breast cancer using genomic data (that is, a system that, in addition to predicting good or poor prognosis, provides an individualized explanation of the reasons for that prognosis for each patient). Moreover, the LAD model provides valuable insights into the roles of individual and combinatorial biomarkers, allows the discovery of new classes of patients, and generates a vast library of biomedical research hypotheses

Springer - Publisher Connector

PubMed Central

Breast cancer prognosis by combinatorial analysis of gene expression data

Author: A Hammer
AA Alizadeh
AI Su
AL Boulesteix
AM Jackson
B Weigelt
C Sotiriou
David E Axelrod
E Boros
E Boros
F Bertucci
G Alexe
G Alexe
G Alexe
G Alexe
G Alexe
G Getz
Gabriela Alexe
H Dai
H Yang
H Zhang
H Zhang
H Zhang
IH Witten
Irina I Lozina
J Eckstein
J Khan
J Khan
JJ Chen
L Liu
LJ van 't Veer
M Bittner
M West
MB Eisen
Michael Reiss
MJ van de Vijver
MPS Brown
MS Lauer
NS Holter
O Alter
O Alter
P Tamayo
P Toronen
Peter L Hammer
PL Hammer
PS Brown
S Abramson
S Alexe
S Alexe
S Gruvberger
S Paik
S Ramaswamy
S Ramaswamy
S Raychaudhuri
SG Hilsenbeck
Sorin Alexe
T Hastie
T Sorlie
T Sorlie
Tibérius O Bonates
TR Golub
TR Golub
TR Sutter
TS Furey
VA Kuznetsov
X Huang
Y Crama
Y Tan
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

BOOLEAN SEPARATORS AND APPROXIMATE BOOLEAN CLASSIFIERS

Author: Irina I. Lozina
Irina I. Lozina
Peter L. Hammer
Peter L. Hammer
Publication venue
Publication date
Field of study

Abstract. A simple technique is proposed for associating to a binary dataset a set of synthetic variables (called Boolean separators) , some of which – if used either alone or in conjunction with the original variables – can enhance the accuracies of various frequently used machine learning / data mining methods. An iterative application of this technique is proposed for the generation of approximate Boolean classifiers, which are shown to increase the accuracy of each of the examined classification methods on each of the examined benchmark dataset

CiteSeerX